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AN EXERCISE (?) IN FOURIER ANALYSIS ON THE 
HEISENBERG GROUP 


DANIEL BUMP, PERSI DIACONIS, ANGELA HICKS, LAURENT MICLO, 

AND HAROLD WIDOM 

Abstract. Let H(n) be the group of 3 x 3 uni-uppertriangular 
matrices with entries in Z/nZ, the integers mod n. We show that 
the simple random walk converges to the uniform distribution in 
order n 2 steps. The argument uses Fourier analysis and is surpris¬ 
ingly challenging. It introduces novel techniques for bounding the 
spectrum which are useful for a variety of walks on a variety of 
groups. 


February 17, 2015 

1. Introduction 

For a positive integer n, let H(n) be the “Heisenberg group mod n” 
the group of 3 x 3 uni-uppertriangular matrices with entries in Z/nZ: 

A 

y x,y, z G Z/nZ. 

V 

Write such a matrix as (. x,y,z ), so 

(x,y,z)(x',y',z') = (x + x', y + y\ z + z! + xy). 

A minimal symmetric generating set is 

(1) S = {( 1, 0, 0), (-1, 0,0), (0,1, 0), (0, -1, 0)} 

A random walk proceeds, starting at the identity (0,0,0), by repeat¬ 
edly choosing an element from S with probability | and multiplying. 
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More formally, define 

< 2 > «<»> = {o ^otherwise 

For g G H(n), 

Q * Q(g ) = QV‘)Q(gh~ l ), Q’ k (g) = J2 QWQ^W')- 

h h 

If n is odd, high convolution powers of Q converge to the uniform distri¬ 
bution U(g ) = ^ 3 , Q* k (g ) — > U(g). Measure the speed of convergence 
by the total variation distance 

- U\\ = max | Q' k (A) - U(A)\ = i ^ \Q ,k (g) - U(g)\. 

g£H 

One purpose of this paper is to give a new proof of the following theo¬ 
rem. 


Theorem 1. ForQ defined by |I|] on H{n), there are universal positive 
constants A and C such that 

. —27r^/e ,, j —27r^fc 

yle n 2 < || Q* — I^||tu ^ Ce~n? 
for all odd n and all k. 


We say “order n 2 steps are necessary and sufficient for convergence.” 
Theorem [I] is proved by Fourier analysis on H(n). This natural ap¬ 
proach is known to be challenging. (See Section i) As explained in 
Section [3j the Fourier proof involves bounding the spectrum of the nxn 
matrices (where the unmarked entries are taken to be zero): 


( 3 ) 


M(0 



1 


\ 



0 < j < m — 1 


V 


1 





These same matrices come up in a variety of solid state physics and 
ergodic theory problems as ‘Harper’s Operator,’ ‘Hofstader’s Butterfly,’ 
and the ‘Ten Martini’s Problem’ discussed in Section [2] They further 
occur in the engineering literature on the fast Fourier transform and 
for other random walks on the meta-abelian groups. 

Section [2] is a literature review, including the parallel problem of a 
random walk on H(R) and H( Z). It describes the physics problems, 
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the connections with the fast Fourier transform, and connections to 
Levy’s area. Section [3] gives background on Fourier analysis on H{n). 
Theorem [T] is proved there assuming our eigenvalue bounds (which are 
proved later). It also gives a variety of other walks on other groups 
where matrices M(£) (and thus our bounds) appear. 

Eigenvalue bounds are given in Section |4} Two new techniques 
are developed. The first uses geometric path methods, developed for 
Dirichlet eigenvalues of stochastic matrices to give upper bounds for 
the largest eigenvalue of M(£); note that some entries of M(£) are 
negative and the rows don’t sum to a constant. The second is a new 
technique for lower bounds on smallest eigenvalues. The bounds are 
rough (but more than adequate for the task) and work generally. 

Acknowledgment. We have benefited from useful comments and tech¬ 
nical help from Laurent Saloff-Coste, Marty Isaacs, Je-Way Zhou, Steven 
Kivelson, Steve Evans, Mehrdad Shahshahani, Thomas Strohmer, Yi 
Zhang, Akash Maharaj, and Arun Ram. Also, the first, second, third, 
and fifth authors would like to acknowledge partial support from NSF 
grants DMS 1001079, DMS 08-04324, DMS 1303761, and DMS 1400248 
(respectively). The remaining author would like to acknowledge partial 
support from ANR grant number ANR-12-BS01-0019. 


2. Background and References 


For background on Markov chains and random walks on groups, a 
good introduction is [28] . The Fourier approach is developed in [13] . 
A variety of analytic approaches are described in [34] and [35]. The 
Heisenberg group is ubiquitous in mathematics and physics through its 
connections to the quantum harmonic oscillator HQ, Harmonic analy¬ 
sis [23], theta functions tra, ro, and number theory [TO] . Analysis 
and geometry on the Heisenberg group have recently been important 
in finding counterexamples to the Goemans-Linial conjecture in com¬ 
plexity theory m When p is prime, the groups H(p) are building 
blocks of the extra special p-groups (groups with center, commutator 
and derived group equal to Z/pZ). See [24] and [ 40] . 

The literature on random walks on the Heisenberg group is reviewed 
the connection to fast Fourier transform is treated in |2.2 the 


m 


2.1 


solid state connection is reviewed in |2.3[ and the connection to Levy’s 
area is given in |2.3 


Section 2.5 describes two companion papers. The 


first generalizes to other groups; the second gives precise asymptotics 
for the top and bottom eigenvalues of M(£). 











4 


BUMP, DIACONIS, HICKS, MICLO, AND WIDOM 


2.1. Random Walk on H{ri). The probabilistic study of a random 
walk on H{n ) was started by Zack ([H]), who gave an interpretation 
of the walk in terms of random number generation. Theorem [T| was 
first proved in joint work with Saloff-Coste (see [13], [IS]. DU) using 
‘geometric theory.’ The first proof used Nash inequalities, the second 
proof uses moderate growth, showing that a similar proof holds for 
any sequence of finite nilpotent groups with bounded class and number 
of generators: (diameter) 2 steps are necessary and sufficient for con¬ 
vergence. (The diameter of H{n) with generating set S is of order n.) 
The third proof of Theorem [I] lifts to a random walk on a free nilpotent 
group (here H(h)). Then results of Herbish-Saloff-Coste give bounds 
on the decay rate of convolution powers for the lifted walk. These are 
transferred down to the finite quotient using Harnack inequalities. The 
beautiful work of Alexopoulos m) should allow similar results for less 
symmetric walks on quotients of groups with polynomial growth. 

A fourth proof appears in [TO]. This completes the generating set 
S' to a union of conjugacy classes. Character theory is used to bound 
the conjugacy walk. Finally, comparison theory is used to go from the 
conjugacy walk to the walk based on S. Richard Stong (E3, M) 
has results which specialize to three yet different proofs. His approach, 
using ‘twisted random walks’ works for other classes of groups, such as 
the Burnside groups B(n, 3). It deserves to be developed further. 

A further set of approaches/results is worth mentioning. A simple 
random walk on n x n uni-uppertriangular matrices with coefficients 
mod p has also been studied in a series of papers. In this work, the 
emphasis is on large n and fixed p behavior but some of the results 
specialize well to n — 3. We refer to the definitive paper by Peres and 
Sly [32] which surveys earlier literature. 

Curiously none of the papers cited above carries out the program at¬ 
tempted here—direct use of Fourier analysis. All we can say is, having 
pushed it through, we now see why. 

There has also been work on a random walk on the Heisenberg group 
H{R). This can be developed as a special case of a random walk on 
Lie groups—see Breuillard jd] for a useful survey. The paper [7] focuses 
on H{R) and proves a local limit theorem using Fourier analysis. Of 
course, this is close in spirit to the present effort. Alexopoulos [I] 
had remarkable decay rates for general discrete groups of polynomial 
growth. 

A natural problem (open to our knowledge) interpolating between 
the finite and and infinite case would be to study the simple random 
walk on H( Z) with generating set S' of ([Tj) . Although Breuillard allows 
discrete measures on H(R), he must rule out measures supported on 
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subgroups. Indeed, as far as we know, the irreducible unitary repre¬ 
sentations of H{ Z) have not been classified (and seem to give rise to a 
‘wild’ problem). 

If Q is the measure supported on S (considered on H{ Z)), the asso¬ 
ciated random walk is transient. It is natural to study the decay rate 
of Q* k . In particular we conjecture that 


(4) 


Q 


*k 


c 4r 2 (4) 

'id) ~ — with c = -414 

' ' k 2 7T 2 


A justification for the conjectured constant appears in Section 2.4 be¬ 
low. 

Notice that Q could be studied by considering Q from ([2]) on H(n) 
for n » k 2 (when there is no possibility of the walk wrapping around.) 
Indeed iL(nZ) is a normal subgroup of H{%) and all of the irreducible 
unitary representations of iL(Z/nZ) described in 3.2 give rise to irre¬ 
ducible unitary representations of H( Z) via 


H{ Z) -A H(Z)/H(nZ) = H{Z/nZ) -A GL{V) 


where V is the vector space underlying the representation of iL(Z/nZ). 
Those quotients are all of the hnite dimensional simple representations 
of H(7i). (See [31].) The usual infinite dimensional irreducible unitary 
representations of H{R) restricted to H{ Z) are also irreducible but alas 
there are others which do not seem to be known at this writing. 


2.2. The Discrete Fourier Connection. For / : Z/nZ -a- C, the 

discrete Fourier transform (DFT) takes / —y f with 


71—1 


h*o = -/=£AO 

\/n ‘f-' 


2ni£ i k 


5=o 

This is a mainstay of scientific computing because of the fast algorithms 
(FFT). There is a close connection between the discrete Heisenberg 
groups H{n ) and these transforms. This is charmingly exposited in 
Auslander and Tolimieri ([3]). Let the matrix associated with the DFT 

2-nijk 

be T n , with (J r n )j,k — e J n ■ Recall the matrix of the Fourier transform 
M( 1) at ([3]) . It is straightforward to see that T n commutes with M( 1): 

T~ X M{\)7 n = M{1). 


This implies that these Hermitian matrices can be simultaneously diag¬ 
onalized. As explained in [19] and [30], there is engineering interest in 
diagonalizing T n . We are interested in diagonalizing M( 1) and hoped 
information about T n would help. 
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Alas, IF n A = id, so the eigenvalues are ±1, ±i. It is known that the 
associated eigenspaces have dimension (within one of) |.([T9j) This 
does not give a useful reduction for our problem. Going the other way, 
Dickinson and Steiglitz [T9j looked for matrices commuting with T n 
and suggested M( 1)(!). Mehta ]3_Q] found a basis for eigenfunctions of 
T n as ‘periodicized Hermite functions’. These are tantalizingly close to 
the approximate eigenfunctions for M( 1) that we hnd in [9]. We are 
sure there is more to understand on this front. 

2.3. Harpers Operator. In trying to understand the band structure 
of the x-ray diffraction patterns of various solids (particularly metals),a 
variety of model problems involving Schrodinger operators with peri¬ 
odic potentials have been introduced and studied. Perhaps the most 
widely studied model is Harper’s model on £ 2 (Z) with Hamiltonian 

He,<j>fi{n) = £(ra + 1) + £(n - 1) + 2cos(27r(n6 ) + 0))£(n), 

n G Z, 9, (p G [0,1]. The spectrum of Hg^ is pictured as “Hofstadter’s 
Butterfly” with its own Wikipedia page. With 2 cos replaced by A cos, 
the operator is often called the almost Mathieu operator (with its own 
Wikipedia page). For 6 irrational, the norm of Hg^ is known not to 
depend on 0. Our operator M(£) is a discretized version of Ht 0 which 

n ’ 

is often used to get numerical approximations to the spectrum. It has 
also been used to prove things, see the beautiful account in Section 3 of 
Beguin-Valette-Zuk PJ. They give bounds on the largest and smallest 
eigenvalues of M(£) which we are able to sharpen considerably in some 
cases. 

Any periodic Schrodinger operator has band structure in its spec¬ 
trum because of Bloch’s Theorem [21] . Determining the fine structure 
of the spectrum of Hg^ has generated a large literature well surveyed 
in [2B], One of the signature problems here was dubbed the “Ten 
Martini’s problem” by Barry Simon. It has recently been completely 
resolved by work of Last [26], and Avila and Jitomirskaya (pi]). 

There is considerable interest in a variety of similar operators, e.g. 
in M(£), replace cos with cos(^^) +cos(^p). See Zhang 

et al pE] and their references. We have tried to develop our bounds 
on the spectrum of M(£) (in Section [4]) so that it applies to these 
generalizations. 

2.4. Levy Area Heuristics. The random walk on H{ Z) with generat¬ 
ing set S — {(1, 0,0), (—1, 0, 0), (0,1, 0), (0, —1, 0)} leads to interesting 
probability theory which gives a useful heuristic picture of the behavior 
on H(n) as well. This section gives the heuristics; mathematical details 
will appear in a later publication. 
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Let the successive steps of the walk be 

(ei, <5i, 0), (e 2 , $2)0 ),..., (e n , S n , 0), • • ■ e S. 

By a simple direct computation, ( e n , $ n , 0)... (ei, $i, 0) = ( X n , Y nj Z n ) 
with 

x n — + ...e„, = $i + • • • + $ n , 

= G$1 + C3($l + $ 2 ) + • • • + ^n($l + ' ' ' + $n-l)- 
Thus the X n and Y n coordinates perform simple random walks on Z 
(P(ej = 1) = P(e* = —1) = |) and (P(ej = 0) = |). The Local Cen¬ 
tral Limit Theorem shows 


P{X n =j} 


g 2 y cry/n 


V2 


TTa 2 n 


with = -. 


The same result holds for Y n and indeed (X n , P n ) have independent 
Gaussian approximations. In particular, 

p{(x n ,y„) = (o,o)}~—. 

7 rn 

Next consider Z n . This is a martingale and a straightforward appli¬ 
cation of the martingale central limit theorem shows that 




Bi(s)dB 2 (s) 


1 

with P(£) the distribution function of 

If 1 

2 Jo 

with Bi standard Brownian motion. We note that B\[t)dB 2 [t) — 
J^B 2 (t)dB 1 (t) is Levy’s area integral, the distribution of the signed 
area between Brownian motion and its chord at time 1. 

In turn, 

[ Bi(s)dB 2 (s) 

Jo 

can be shown to have the density 




7T 2 


-OO < £ < OO 


See Harkness and Harkness csa for background and details. 

If ^=, ^=, ^ are asymptotically independent, this would suggest 
that 

c 4T 2 (-) 

P{{X n , Y n , Z n ) = (0, 0,0)}-j, c=- 

n z n z 


rv-/ 
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Note that Breuillard [f] has a local limit theorem of a similar sort for 
a non-lattice walk on H(R) with a less well identified constant. We 
conjecture that this is onr c. A further heuristic conclusion which this 
paper makes rigorous: the walk on H( Z) can be mapped to the walk 
on H(rn) by considering the coordinates mod m. This familiar trick 
works, for example, for AA(mod m), showing that the simple random 
walk mod (m) requires order m 2 steps to get close to uniform. The 
same argument works for the joint distribution of (X n ,Y n ). The proof 
for these cases requires something like the Berry-Esseen Theorem. See 
Diaconis [15] for details for X„(mod m ) and [Ij for details on H(n). 
The limit theorem for — suggests that order m steps are necessary 
and sufficient. This suggest that order m 2 steps are necessary and 
sufficient for randomness on H(m). This is exactly what Theorem [I] 
proves. 

This aspect of ‘features’ of a Markov chain getting to their stationary 
distribution at different rates is important in practice. A simulation is 
often run with just a few features of interest and the relaxation time 
for the whole state space may be deceptive. See [2] for references to 
a handful of cases where this can be carefully proved. Generalizing, 
if U n {m) is the group of n x n uni-uppertriangular matrices with co¬ 
efficients mod m, let a simple random walk be determined by picking 
one of the n — 1 coordinates just above the diagonal and placing ±1 
there (with the rest zero.) We conjecture that coordinates in the first 
diagonal require order m 2 step; in the second diagonal, order m steps; 
in the third diagonal, order ms steps; with order m 2//fc steps for the kth 
diagonal. Here n is fixed, 1 < k < n — 1 and m large are the cases of 
interest. 


2.5. Future Developments. The present paper has two companion 
papers. The first [H] abstracts the present results to a larger class 
of walks and groups. The second gets much more precise bounds on 
the extreme eigenvalues of the matrices M(£). Since essentially these 
matrices come up for the generalizations, present techniques and the 
refinements can be used to give satisfactory analyses of the generaliza¬ 
tions. 

Let G be a finite group with a normal abelian subgroup A and a 
subgroup B such that G = BA, B D A = id. Thus G = B ix A is a 
semi-direct product. Let cq,... ,a s be a symmetric generating set for 
A ; b\,..., b t a symmetric generating set for B. Then {a i} bj} generate 
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G. Consider the probability measure (with 0 < 9 < 1 fixed): 


Q(g) 



if g e {ai,... ,a s } 
if g e { 6 i, ■■■A] ■ 

otherwise 


The representation theory of semi-direct products is well known. See 
Serre Chapter 7 ([36j). The irreducible representations are determined 
by the representations of B and by inducing the characters of A up to 
G. Let x G A. If p x (b,a ) is the induced representation (a \B\ x \B\ 
matrix), 

Q(Px) = dp B + (1 - 0)D, 

with Pb the transition matrix of the 5-walk (based on a uniform choice 
of bj) and D a diagonal matrix with 6 th element 

Dip A = - ^ X (p~ l aib). 

s t T 


This is in sharp parallel with the present development and we have 
found that the techniques of the present Section [4] suffice for successful 
analyses. Examples from [ 8 j include: 

• The random walk on H(n) generated by (x(i), 0, 0) and (0, y (j ), 0) 

for (perhaps non symmetric) generating sets 1 <j< 

J chosen with some (perhaps non-uniform) probabilities. 

• The d-dimensional Heisenberg groups G = (x,y,z), x,y 6 
(Z/nZ) d , x G (Z/nZ). 

• The affine groups: G = (a, b), 6 G Z/nZ, a G (Z/nZ)* 

These and further examples are treated in [ 8 ]. That paper also develops 
a useful supercharacter theory which allows the comparison approach 
H3 .[16] to be used for this class of examples. The second compan¬ 
ion paper gives sharp bounds on the eigenvalues and eigenvectors of 
matrices M(£), for certian choices of £ and dimension m. For ease of 
exposition, work with M( 1). It is shown that the kth largest eigen¬ 
value of M( 1) is equal to 1 — ^ + o (^-) where yk is the kth smallest 
eigenvalue of the harmonic oscillator 


L 


1 d 2 

2 dx 2 


+ 2tc 2 x 2 . 
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on (— 00 , 00 ) (so Hk = (2k + l)7r), 0 < k < 00 fixed. The corresponding 
eigenfunctions of L are well known to be Hermite functions; with a = n, 

Mx) = e ~ ax2 

4>i(x) = —4 axe~ ax2 


d n d 

Mx) = e ax "— (e~ 2ax2 ) = e ax2 —e~ ax2 Mi(x) 
dx n ax 

It is also shown that wrapped versions of these are approximate eigen¬ 
functions of M( 1). Numerical comparisons, not reported here, show 
that these approximate eigenfunctions are very accurate for fixed k 
and large n. 

The present technique gives cruder results which are all that we need 
and are available for all £. 

3. Representation Theory 

This section develops the tools needed for representation theory: 
Fourier inversion, the Plancherel Theorem, representations of H(n ), 
and a formula for the distribution of central elements under convo¬ 
lution powers. Theorem [T| is proved when n is prime assuming the 
eigenvalue bounds in the following Section [4j For background, see [ 36] 
and [15]. 


3.1. Basic Formulas. Let G be a finite group with G the irreducible 
unitary representations. For Q a probability on G and p G G, define 
the Fourier Transform: 


Q(p) = ^2Q(g)p(g)- 

9 

As usual, Fourier transforms take convolution into products: Q * Q(p) = 
( 1 Q(p )) 2 . For the uniform distribution U(g) = py, Schur’s lemma im¬ 
plies that U(p) = 0 for all nontrivial p G G. The Fourier Inversion 
Theorem shows how to reconstruct Q from Q(p): 

Q(g) = j4^^tr(Q(p)p(5)’). 

p&G 
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The basic Upper Bound Lemma ( see |T6] page 24) gives a bound on 
the distance to stationary using the Fourier Transform: 

4||V , ‘-0I™<£AN‘MII 2 ' 

peG 

In these sums, d p is the dimension of p, ||M|| 2 = tr(MM*) and p — 1 
is the trivial representation. 


3.2. Representation Theory of H(n). A neat elementary descrip¬ 
tion of the representations of H{n) appears in |20j . We find the follow¬ 
ing self contained derivation more useful for our application. For every 
divisor m of n, 1 < m < n , there are (—) </>(m) distinct irreducible 
representations of degree m, where (f>{yn) is the Euler phi function. 
Since 



this is a complete set. 

For the construction, fix m\n. Let 


V = {f : Z/mZ C} . 

Now fix a, b e Z/(^Z) and c e ZjmZ with (c, m) = 1. Let q m = 
q n - e 2m t n . Define y, z) : V -f V by 

pa,„Ax, y, z)f(j) = e + N;? J+ 2 , /(j+ 1 ). 


In these formulas, regard Z/(—Z) and Z/(mZ) as subgroups of Z/nZ 
by sending 1 to m (resp. n/m .) Using (x,y, z)(x'y'z') = (x + x',y + 
2 / r , 2 + z 1 + a;?/ 7 ), it is easy to check that p a ,&, c is a representation: 


Pa,&,c(u V , S/', = Pa,b,c{ X i V, z)q^f +W3+zf) f {j + x') 

— g«U+3:')+ 6 (2/+2/ , )gC(’/'0'+ a; )+ 2: '+J/t+2) _)_ 2-') 

= Pa,6,c(^ + x', J/ + J/', 2 + z! + Xy')f(j). 


To compute the character, note that the trace of p a ,b,c(x, y, z) is zero 
unless m\x. In this case, the trace is qn X+by Qm J2j =o I'm ■ This sum is 
zero unless m\y (when it is m). Thus: 


Xa,bA x ,y, z ) 


0 unless m\x and m\y 

qn X+by Qm m if m\x, y 

Distinctness and irreducibility can be determined by computing: 


U<,AclX«',6V> = A E = <ww. 

n 6 


n,j2,z 
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In the sum, j i, j 2 run over 0,1,, n/m — 1 and z runs over 0,1,..., n — 
1. Since a, b can take any values in 0,1,..., — — 1 and c runs over num¬ 
bers relatively prime to m, this makes (—) 2 0(m) distinct irreducible 
characters of dimension m. Summing over divisors of n accounts for 
all of the characters. 


Example 1. When m — 1, this gives the n 2 one dimensional characters 
Xa,b(x,y, z) = q° r ' x+by . Note that q\ — 1 so that c doesn’t occur.) 


Example 2. When m — 2, set 

8(x) = 


1 if a; is even 
0 if x is odd., 


8(x) S(x) 


with 8(x) — 1 — 8(x). 

S{x)qyJ 

Note that tr (p atb i(x,y, z)) agrees with Xa,b,i above. 
Example 3. Consider Q(a, b, c ) for 


2(1,0,0) = Q(—1,0,0) = 2(0,1,0) = 2(0,-1,0) = 

When m = 1, Q(a, b) = \ cos (^) + \ cos (^) for 0 < a, b < n — 1. 
When m = 2 (so n is even) 


Q(a,b, 1 ) = ^ 

In general, when m > 3, 

/ 


COS 1 

(2nb\ 

\ n J 

cos ( 

2na \ 

cos I 

(2na \ 

K n ) 

V 

— cos 

n J 

ml 


2(o, b, c) = - 


q a 

■. in 


<fn 



QnQm + Q-. 


~b n ~jc 
dm. 


0 < j < m — 1 


y 'in 

Note that conjugating Q(a, b, c ) by a diagonal matrix with 1, q 2a ,... 

on the diagonal results in a matrix with ones on the super and subdi¬ 
agonals (and q~ ma , q^ a in the two corners). 
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Example 4. When n is a prime p , there are p 1 one-dimensional rep¬ 
resentations and p — 1 p-dimensional representations, with Q(0, 0 , c) 
conjugate to M(c) of the introduction. 


3.3. Proof of Theorem [l] when n = p is prime. From the Up¬ 


per Bound Lemma stated in Section 3.1 and the formulas for Q{p) in 


Section 3.2 


4|| Q' k - Ufry < )T 

(a,6)^(0,0) 


1 

-COS 


2na 

p 


+ -COS 


2nb 

V 


2 k 


\k || 2 


= / + // 


P~ 1 

+£p||v«) 

?=1 

To understand /, hrst consider a = 0 and b = 1. This term is 


1 1 /2tt\ 

—I— cos — 

2 2 \p J 


■2k 


71 


= 1 -- + 0 - 


p* 


p 


2k 


This term is of size e Bv if k = pp 2 with B = 2tt 2 . A standard analysis, 
carried out in detail in [TBj shows that the other terms in (I) can be 
similarly bounded and that the sum of these terms is at most C'e~ 47r “ ,? 
for a universal constant C , independent of p and k. Consider the 
next sum (II). The matrix M(£) is symmetric with real eigenvalues 
1 > /MO > • • > /MO > —1. In Section [4] it is shown that for 
0 < f 


log(p) ’ 


M(0 = max(/3i(0, —^p(O) < 1 - 


9 


p 


,4/3 


for a universal constant 9. Also, for . p , . < £ < § it is shown that 

’ log(p) — s 2’ 




Furthermore note that 


M(0 = M(p-0- 


Indeed, as mentioned in Section |2.5[ much sharper bounds are found. 
These are not relevant for this proof. Using these bounds with 9 = 
min{ 0 , f}, 


II <p 3 


V 2k 


>4/3 


p 


+ p 


9 \ 


2k 


log ip)) 


This is exponentially smaller than the bound for /, completing the 
proof of the upper bound. 
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A lower bound for the total variation follows from a lower bound 
for the total variation convergence of the ( 1 , 2 ) coordinate X k to the 
uniform distribution on Z/pZ. As explained in Section 2.3, X k evolves 


as a simple random walk on Z/pZ and a lower bound showing that 
k = dp 2 steps is not sufficient for a fixed 6 is well known. See [15] for 
several different proofs. This completes the proof. 

We will not carry out the details of the proof of Theorem [l] in the case 
of general n. Bounds for the various transforms Q(a, b, c ) are derived 
in Section [4] No new ideas are needed. 


3.4. A Formula for the Center. Let n = p be an odd prime. Let Zk 
be the value of the central element after k steps of the walk. A formula 
for P{Zk = z} can be derived from the Fourier Inversion Theorem. We 
do not see how to use this to derive useful estimates but record it here 
in case someone else does. 


Proposition 1. For a simple random walk on H(p) with p an odd 
prime, all k > 0, and z E Z/pZ, 


1 1 P_1 
P{Z t = z] = - + 

p rr 
1 F £=1 


— 2ni£z 

e p 


3 = 1 


Proof. From the Fourier Inversion Theorem, for any (x, y, z) 

P(X k = x, Y k = y,Z k = z) = 4 d p Tr {(Q(p)) k p((x, y, z ) -1 )) . 

P ped 

Sum over x, y and bring this sum inside the trace. The terms for the 
1-dimensional representations are easily seen to vanish except of course 
for the trivial representation which contributes p 2 /p 3 = 1/p. For the 
p dimensional representations, writing p^ for Po,o,£> p^((x, y, z)^ 1 ) = 
p^(—x, —y, —z + xy). In the basis of Section 

Pe(-z, ~y, ~z + xy)f(j) = e ^ { ~ yj ~ z+xy) f(j - x). 

Fixing x and summing over y, e p (i~ x ) y _ Then, 

— 27 vi£z 

pd~ x , -Vi ~ z + x v)fU ) = P e p /(o). 

y 

Summing over x gives 


2.3 


_ X 

Pt(~ x , -V, ~z + xy) = pe~ 


'1 0 

1 0 


O' 

0 


Multiplying by Q(pf) k and taking the trace gives the result. 


□ 
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Remark. Of course, the elements in the first row of Q k (p |) tend to 
zero, but it does not seem simple to get quantitative bounds. 


4. Eigenvalue Bounds 


This section gives upper and lower bounds for the eigenvalues of the 
matrices M(£) of (j3j). Recall that these are n x n, with | in the upper 
right and lower left corners and ^cos(^p) for 0 < j < n — 1 on the 
main diagonal. These matrices have some symmetry properties, noted 
in Section |4.1[ which simplifies the job. Section |4~2 gives upper bounds 

for the largest eigenvalue of the form 1 — for 9 an absolute 

positive constant and £ << n. The argument makes novel use of 
bounds for Dirichlet eigenvalues using geometric path techniques. It is 
useful for more general diagonal entries and less structured matrices. 
A different variant bounds the largest eigenvalues for larger £. Section 
4~3] gives lower bounds for the smallest eigenvalues. These use quite 


different ideas: Comparisons with matrices constructed to have ‘nice’ 
eigenvalues. 

The following series of figures shows numerical examples. Figure [T| 
shows the top eigenvalues /3 i(£) of M(£) when n = 150. As £ varies 
from 1 , 2 ,... for small £, the top eigenvalues are close to 1 and fall 
off as £ increases but they are not monotone. The evident symmetry 
1 3\{c ) = /3 1 (n — c ) is proved in Section |4~Tj Figure [2] shows the smallest 
eigenvalues of M(£). While these appear as a mirror image of Figure 
[lj a closer look at the numbers shows this is only an approximation 
and the same proof method does not work. Figure [3] shows all of the 
eigenvalues of M(l) for n = 150. 


4.1. Symmetries and Inclusions. While many of the properties proved 
in this section are quite robust, the following more special properties 
are needed. For positive integers n, £ and any real a, let -D n ,£,a denote 
the n x n diagonal matrix with diagonal entries: 

D nia = cos {~^ a + •?')) 0 < 3 < n - 1. 

Let M n ^ tCt = \{D n ^ )a + V)) where V is the transition matrix of a simple 
random walk on TL/nTL (| on the diagonals just above and below the 
main diagonal and | in the upper right and lower left corners.) Thus 
M n £,o = M(£). Let 5(n, £, a) be the spectrum of M n ^ a . 

Proposition 2. (a) S(n, £, 0) = S(n, n — £, 0) 

(b) For all positive integer k, S(n,£,a) C 5(fcn, fc£, a) 
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The highest eigenvalue of M(Q when n = 150 



0 20 40 60 80 100 120 140 


FIGURE 1. The top eigenvalues of M(£) 


The lowest eigenvalue of M(£) when n = 150 



0 20 40 60 80 100 120 140 


Figure 2. The lowest eigenvalues of M(£) 


(c) If n is even, S(n,f,a) = —S yn,£,a+ ^J 

(d) If n is odd, S(n,f,a) C —S (^2n,2f, a + 

Proof. For 0’ 


cos 


2n(n — 
n 



cos 





Thus M(n, 0) = M{n,n — 0). For (0, if 0 is an eigenvector of 

M n ^ a then juxtaposing it k times gives an eigenvector of M kn k ^ a with 
the same eigenvalue because of periodicity. For Q, suppose if is an 
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The eigenvalues of M( 1) when n = 150 



0 20 40 60 80 100 120 140 


Figure 3. All the eigenvalues of M( 1) 


eigenvector of M n ^ a with associated eigenvalue 6. Thus: 


2 < 


a + j)) V’O') = WU) 


tOC? + 1) + V’O' - !)) + 1/2 COS 
4 \ n 

Set <f>(j) = (-1) j Xj). Then 

+ 1) + 0(j - 1)) - ^ cos + j) ) 0(i) = 


but -cos(^(a + j) = cos ((a + j + 

For properties Q and Q imply that 

S(n,£,a) C S(2n,2£,a) = -S ^2n,2f,a + ^0 . 


□ 


Remark. The operator with Is above and below the diagonal (and in 
the corners) and V cos (—(j + a)) with V > 0 a constant is a discrete 
version of the “almost Mathieu operator” of solid state physics. See 
|42j . The discrete operator corresponds to 4M(n, £, a) when V = 2. 
This is well known to be the critical case in the theory. The £’s of 
physical interest scale as O(n), which corresponds to the case when 
Z n is replaced by Z. When V > 2 the eigenfunctions are localized. 
For V < 2, they are spread out. The spectrum when V = 2 is called 
“Hofstadter’s Butterfly.” Its spectral properties have recently been ob¬ 
served in graphene j33]. All our ratios are rational. It is a fascinating 
question if the limit as ^ tends to an irrational has limiting spectrum 
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matching the spectrum of Harper’s operator on £ 2 . See 321 for further 
discussion. 


4.2. Dirichlet Path Arguments. For general n, M(£) = Q(0, 0, £) 
is an nxn matrix with {cos(^yp) on the main diagonal, | on the 
super and subdiagonals and in the upper right and lower left corners. 
Then \l + M(£) has non-negative entries with row sums in [|, |], Fi¬ 
nally, = |/ + |Af(0 is a substochastic matrix with | on the super 
and subdiagonals (and the corners) and | (l + cos GD) on the main 
diagonal for 0 < j < n — 1. The row sums are between | and 1. Form 
an (n + 1 ) x (n + 1 ) stochastic matrix K^ by adding an absorbing state 
oo, with transitions from j to oo, K^(j, oo) = | (l — cos (^yp))- Set 
K(oo,oo) = 1. Bounds on the eigenvalues of L^ (and so M(£)) give 
bounds on the Dirichlet eigenvalues of K^. Use the notation of [12] 
Section 4.2 with S = {0,1,..., n — 1}, S = S U {oo}. The Markov 
chain A/ restricted to S is connected. Let U(j) — 1/n, 0 < j < n — 1 
with 

£ 2 (U) — {/ : {0,1,..., 7i — 1} —>■ M} 

having inner product (f\g)u — ^jes fti)dU)U(j)- As required, for all 
x,y G S, U(x)K^(x,y) = U(y)K^(y,x). When needed, functions / on 
S are extended to S as /(oo) = 0. (Also set f/(oo) = 0.) Define a 
Dirichlet form £ on £ 2 {U) by 

A/I/) = \ E (/(*) - f(y)) 2 U(x)K ( (x,y). 

x,y£S 


Let f3 = /?(£) be the largest eigenvalue of L^. From [T 2 j Lemma 19, if 
there is a constant A > 0 such that \\f\\\, < A||£(/)||, for all / G £ 2 (U), 
then (3 < 1 — -4. This shows that the largest eigenvalue of M(£) is at 
most 1 — 

Geometric arguments require a choice of paths 7 x starting at x G S 
to 00 , with steps which are possible with respect to A/. Thus 7 ^ = 
(x 0 = x,Xi,X2 ,... ,Xd = 00 ) with K^(xi,x i+ i) > 0 for 0 < i < d — 1 . 
Let d — 7 / denote the length of this path. From [T2] Proposition 2c, 


(5) 


(3 < 1 —- with A = max 


A 


xes, y es K ( (x,y) 


Y It* 


zes, (x,y)e^ z 


The bounds are better if A is ‘small.’ From (|5]) this happens if 
both the paths are chosen so that no edge occurs too often and the 
(x, y) edges that appear don’t have small value of K^(x, y). With these 
preliminaries we next prove: 
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FIGURE 4. Splitting points into groups according to the 
cycles of cosine. Note the values marked x* t and x*. 


Proposition 3. There exists a positive constant 9 such that for all 
n and 1 < f < the largest eigenvalue /3(f) of M(f) satisfies 


m < i - o (h 4/3 - 


Proposition 4. For 


I°g(n) 


< £ < n 
— s — 2’ 




Proof of Proposition [3| For 1 < f < , let x* 



. Break the 


f cycles of cos(^p) at |_|J, 2|_|J,... into f groups, eg. when f = 3 
breaking ties to the right. See Figure lil Each group has | + 0(1) 
points. For each group, define a point x/ by counting x* points from 
the left and x* u by counting x* from the right of the groups. Each group 
is treated identically and we here focus on the first group. From the 
development above, for any j between x\ and 2xj/, 


K^(j, oo) 




1 I c ( 2<X * 1 

2 { n In 


Thus K ^. ^ < 6 J (1 + o(l)) for an explicit constant 9, in¬ 

dependent of f,j,n. For points r between 1 and x\ — 1, connect r to 
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oo 



FIGURE 5. The chosen path from 1 to oo is given with 
solid lines, while the chosen path from 2 to oo is pictured 
with dashed lines. 


infinity by horizontal paths r, r + 1 ,..., (r + x] + 1 ) going directly from 
there to oo. See Figure |5j 

Do the symmetric thing with the rightmost x* — 1 points at the right 
of group 1 , (moving leftward.) Connect points from x* to x* u directly to 
oo with paths of length 1. Treat each of the other groups in a parallel 
fashion. 

Bounds for A in (|5]) follow from bounds from the edges in Group 1. 
For the edge (r — 1, r), order x* paths use this edge and each such path 
has length x*. Since r , , 2 ^ f , —jt = 12 , the contribution from 

this edge is at most of order ( x *) 2 . Next consider edges (j, oo) with j 
between x\ and 2 x z * or between x* u — x* and x* u . At most two paths 
use such an edge, one of length x* (and the other of length one.) The 

contribution here is of order x*. The exponent | in our choices 

of x* was chosen to make (x *) 2 and A of the same order. The 
only other edges used are (j, oo) with j from 2x* t to 2x*. These are 

\ 2/3 


used once for paths of length 1 and i s most of order ^|J 

Combining bounds completes the proof of Proposition |3j □ 

Proof of Proposition [7} Proceeding as above, break 0 < j < n — 1 into 
groups of | points (within 1 ). Again, the groups are treated identically 

and we focus on the first group. Let x z * = \ | and connect points of 

0 < j < x\ to points x] +j and then to oo. Since cos (^£(x* + j)) < 0, 
Kc(x*+j oo) — 6 - F° r x*i to x*, connect points directly to oo. For j at 
the right of group 1 , connect backward to x* — j and then to oo in 
symmetric fashion. Now, < 6. For all x\ < j < x* that occur in 

this choice of paths, these edges (from j to oo) have at most 3 paths 
through them, the longest path of length of order This gives 5| 
as an upper bound for those edges. For edges (j,j + 1), K ^- +1) = 12. 
The maximizing such edge is (x* — 1, Xj). This has x\ paths using it of 
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FIGURE 6. Diagonal elements for £ = 1 


length x* contributing at most 12(x *) 2 = | . Combining bounds 

completes the proof of Proposition [4j □ 

4.3. Lower bounds on Negative Eigenvalues. We derive crude 
but useful lower bounds on the negative eigenvalues of M(£) by using 


the upper bounds from Section 4.2 together with the inclusion from 
Proposition [2j Suppose that n is odd (the case of interest for proving 
Theorem [I]). From Proposition [2] ([d]), the following spectral inclusion 
holds: 

Tl 

S(n, £, 0) C —S(2n, 2£, —). 

Thus bounds on the smallest eigenvalues of M(£) follow from bounds on 
the largest eigenvalues of 2 n x 2 n matrices with diagonal cos + 7r), 
0 < j < 2n — 1. First consider the case of £ = 1, the diagonal elements 
appear as in Figure [ 6 } A cyclic shift of the diagonal doesn’t change the 
spectrum so we may consider the 2 n x 2 n matrix with diagonal entries 

cos ^ 7r( - 2 ^ +1 - > j , 0 < j < 2 n — 1, as in Figure 7| Now, the same paths 

and bounds used in Section 42] can be used to get upper bounds. We 
will not repeat the details but merely state the conclusions. 

Proposition 5. There is a positive constant 6 such that for all odd n 
and 1 < £ < lo ^ , the smallest eigenvalue /?*(£) o/M(£) satisfies 


/ 3 *(£) — 1 + O 


iff 

n J 
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FIGURE 7. A shift of the diagonal elements for £ = 1 

Proposition 6. For < £ < §, &(£) > -1 + § (J) 2 (1 + o(l)) 

Remark. The case of even n arises naturally as well. This may be 
treated in parallel fashion using Proposition [2] (Jc]) to reduce things to 
bound the largest eigenvalues of slightly shifted matrices as above. 
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